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ABSTRACT 

A comprehensive search for variable and transient radio sources has been 
conducted using ~ 55,000 snapshot images of the FIRST survey. We present an 
analysis leading to the discovery of 1,627 variable and transient objects down to 
mJy levels over a wide range of timescales (few minutes to years). Variations 
observed range from 20% to a factor of 25. Multi-wavelength matching for coun- 
terparts reveals the diverse classes of objects exhibiting variability, ranging from 
nearby stars and pulsars to galaxies and distant quasars. Interestingly, more 
than half of the objects in the sample have either no classified counterparts or 
no corresponding sources at any other wavelength and require mult i- wavelength 
follow-up observations. We discuss these classes of variables and speculate on 
the identity of objects that lack multi- wavelength counterparts. 

Subject headings: catalogs — methods: data analysis, statistical — radio continuum: 
general — surveys 
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Introduction 



The first celestial 



of its variability (JHey 



radio source detected - tlie Sun - was discovered as a consequence 



19461). Nonetlieless, while variabihty has continued to be a source of 
discovery and insight in the radio regime, few dedicated searches for variables and transients 
have been undertaken, primarily because the observing times required to conduct deep 
wide-field surveys are large and curren t instruments tend to have poor figures of merit for 



variability studies (iCordes et al. 



20041 ) ■ 



Indeed, one of the key scientific aims of the next generation of radio telescopes such 
as LOFAR, ASKAP, MWA and , eventually, the SKA is a study of radio variability. As 
discussed by 



Hessels et al. 



(|2009[ ). LOFAR will survey the sky for pulsars and fast transients 
at low frequencies (30-240 MHz) with an emphasis on timescales of less than a second. CVs, 
X-ray binaries, GRBs, SNe, AGN, fiare stars, exoplanets and many more new phenomena, 
as well as extrinsic causes such as interstellar and interplanetar y scintillation, are e xpected 



to contribute to its inventory of radio transients and variab 



Australian SKA Pathfinder (ASKAP; 



Johnston et al. 



2007 



es (JFender et al 



20081 ). The 



20081 ) is designed to achieve 



very high survey speeds and noise levels down to ~ 10 — 100 /xJy in an hour in the 1 GHz 
band, and will shed light on GRBs, radio supernovae (RSNe), Intra-Day Variables (IDVs), 
etc. The Murchison Widefield Array (MWA) will operate in the 80-300 MHz range carrying 
out a blind search for variables and transients in addition to targeting explosive events. 



stellar and planetary phenomena, and corn pact objects o ver a range o f timescales 



nanoseconds to years (JLonsdale et al 



20091 ). The EVLA flNapier 



2006 : 



Rupen 



rom 



20001 ) will 



offer vastly improved sensitivity along with dynamic scheduling, providing a host of new 
capabilities for transient and variable searches. Finally, a search for radio variables and 
transients also forms an important part of the scientific objectives of the Square Kilometer 
Array (SKA). The SKA is expected to discover a number of classes of variable radio 



-4- 



objects such as pulsars and magnetars, GRBs that are 7-ray loud and 7-ray quiet in both 



afterglow and prompt emission, sub-stellar objects such as brown dwar 



microquasars, and potentially new classes of astrophysical phenomena (ICordes et al. 



s and exoplanets, 



Cordes 



2008 



Wilkinson et al. 



20041 ) ■ 



2004 : 



Most of the research on radio variability to date has focused on brig ht radio samples 



le.g., 
5 GHz 



Gregorini et a. 



AUer et al 



t9m, S > 0.4 Jy at 408 MHz; 



Lister et al. 



(120011 ). S > 0.4 Jy at 



(|2003[ ). 5 > 1.3 Jy at 5 GHz). At these flux densities, the radio source 



population is dominated by AGN, while at 



by st ar-forming galaxies ( jWindhorst et al. 



ainter flux densities 



1999 



' < 1 raJy), it is domin ated 



Richards et al 



1999 



Hopkins et al. 



2OOOI ). Research on the variability of faint radio sources has so far yielded small samples 
as a consequence o f the small areas searched and the long integration times required. 



Carilli et al 



(120031 1 observed the Lockman Hole region on timescales of 19 days and 17 
months down to 0.1 mJy at 1.4 GHz and discovered nine variable sources, providing an 
upper limit to the areal density of variable radio sources, as well as constraints on the 
beaming angle of GRBs and confusion limits relevant to searches for GRB afterglows. 
Archival VLA calibrator observat ions spanning 22 years with 944 independent epochs have 



been used by 



Bower et al. 



( I2OO7I ) to search for radio transients over a single field with 



half-power beam widths of 9'0 and 5'A at 5 GHz and 8.4 GHz, respectively. The typical 
observation's flux density threshold is 300 yuJy. They detect eight transients in single epochs 
and two transients in two-month averages of the data. Two of the transients were identified 
as RSNe, while the absence of optical counterparts for the remainder offers a wide variety 
of possibilities including Orphaned GRB Afterglows (OGRBA), stellar sources, propagation 
effects, microlensing events, or perhaps mechanisms heretofore unknown. 

In the Galaxy, single radio bursts detected from the Parkes Multibeam Pulsar Survey 
revealed a new population of neutron stars: Rotating Radio Anomalous Transients (RRATs; 
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2005 



McLaughlin et al.l 120061 ) . Galactic Center Radio Transients (GCRTs; iHyman et al.ll2002l . 



20091 ) have diverse light curves with outburst periods and burst durations varying 



from minutes to months. Lacking counterparts at other wavelengths, their hosts, and the 
physical mechanisms involved, remain a mystery. 



Two large, sens itive, wide- fie 



Sky Survey (NVSS; 



Condon et al 



d surv eys exist at radio wavelengths. The NRAO VLA 



19981 ) covered the entire sky north of -40°declination at 



1.4 GHz with a resolution of 45"and an r ms noise of . 45 m Jy. The Faint Images of the 



Radio Sky at Twenty-cm survey {FIRST; iBecker et al.lll995l ) has covered > 9,000 deg^ of 



the sky at 1.4 GHz with a uniform rms of 0.15 mJy and an angular resolution of 5'.'4. A 
number of authors have used the se two surveys in their overlap region to search for radio 



transients. 



Levinson et al. 



( I2OO2I ) tried t o constrain the detection rates of orphaned radio 



afterglows often associat ed with GRBs 



Gal- Yam et al. 



the sample generated by 



Levinson et al 



(12006[ ) attempted to characterize 



( I2OO2I ). They report the detection of a radio SN in 



a nearby galaxy and the detection of a source with no optical counterpart, concluding the 
latter is unlikely to be associated with a GRB. They also place tighter constraints on the 
beaming factor of GRBs and a limit on total rate of nearby relativistic explosions, implying 
that most core collapse SNe do not eject unconfined relativistic outfiows. These studies 
have been complicated by the mismatched resolutions and fiux density sensitivities of the 
NVSS and the FIRST surveys. 



de Vries et al 



fl2004[ ) have used ~ 120 deg^ of the FIRST survey data near 5 = 0°taken 
in 1995 and then repeated in 2002 to study the optical properties of sources that exhibit 
significant radio variability at 1.4 GHz over this seven-year interval. They find 123 variable 
objects with fiux densities ranging from ~ 2 - 1000 mJy. They conclude that there is a 
higher fraction of quasars in the sample of variables compared to the non-varying sample. 



More recently. 



OfeketaL 



( I2OIOI ) have raised a very interesting possibility that the 
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mysterious transients of iBower et al.l ( 120071 ) observed out of the plane of the Galaxy, based 
on their areal number density, duration and energy characteristics, could have a progenitor 



population consistent with being Gala ctic isolatec 



of the order of kp c. X-ray follow-up (jCroft et al. 



old neutron stars with mean distances 



Bower et al 



20 111 ) of the transients reported by 



( 120071 ) mostly resulted in non-detections and the X-ray flux upper limits imply 
consistency of the progenitor population with the above possibility besides being consistent 
with extreme flare stars at > 1 kpc or less extreme flares from brown dwarfs at distances of 
~ 100 pc. 

Here, we use the data from the FIRST survey to create the largest, unbiased sample of 
variable radio sources to date down to a sensitivity level of a few mJy. In §2.11 we describe 
the attributes of the FIRST survey relevant to a study of radio variability. In ^ we present 
our approach to extracting a sample of variables from the more than two million individual 
source observations contained in the database. We then describe {^ our refinement of 
the list of variable candidates, the parameterization of their variability, and a summary 
of their properties. In ^ ^ and ^ we present the results of our analysis, including the 
cross-identification of our sample of transients and variables with existing data at optical 
wavelengths (and in other spectral regimes), and a discussion of the classes of variables 
identified, as well as the new populations which might exist among the unidentified radio 
variables. 

2. The Data 



The FIRST survey, described below, is the most sensitive large-area radio survey in 
existence, and provides a database that is uniform in angular resolution and flux density 
sensitivity. Until the next generation of radio instruments arrives, it offers the opportunity 
to produce the largest unbiased survey yet undertaken for radio transients and variables. 
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We outline here the sahent characteristics of the survey. 



2.1. The FIRST Survey 

The FIRST survey covered ~ 9,055 deg^ (8,444 deg^ in the north Galactic cap 
and 611 deg^ in the south Galactic cap) with observations conducted between 1993 and 
2004 using the NR AO VLA in its B-configuration operating at a frequency of 1.4 GHz 



( iBecker et al 



19951 ). Roughly 65,000 three-minute snapshot images were obtained over 
this period. All of the work published to date, including the FIRST catalog, has been 
based on ir nages constructed b y co-adding these snapshots with appropriate weightings (for 



details, see 



White et al. 



19971 ): this process yielded images with a uniform noise level of 
~ 0.15 mJy. The final catalog contains 816,331 sources with a source detection threshold 
of ~ 1 mJy, yielding a source density of ~ 90 deg~^. The astrometric accuracy of the 
cataloged sources is better than 1". The co-added maps have 1'.'8 pixels and an angular 
resolution of 5'.'4. The FWHM of the VLA's primary beam is ~ 30'. The catalog also 
assigns a sidelobe probability to each source. It must be noted that, for this study, we 
have made use of "08jull6" version of the catalog available on the FIRST survey's webpage 



(http://sundog.stsci.edu/first/catalogs/readme.html). 



Becker et al. 



( 119951 ) describe the survey design and objectives in full detail. Here, 
we provide a brief review of the aspects of the survey relevant to a variability search. In 
order to obtain uniform sensitivity and sky coverage in the most efficient manner possible, 
a hexagonal grid is optimal. The pointing centers of each snapshot were placed at the 
vertices of this hexagonal grid, the shape of which changes slowly with declination. The 
typical spacing between two neighboring snapshots is < 26'; thus, all adjacent snapshots 
overlap. The time between overlapping observations varies: ~ 3 minutes between adjacent 
snapshots along the same declination strip in the east-west direction, ~ 1 to many days 



between adjacent snapshots that he on two adjacent strips of dechnation in the north-south 
direction, and ~ 1 to a few years between two adjacent blocks of annual allocations. Thus, 
while the survey was not undertaken to find variables and transients, its design is implicitly 
conducive for a search for radio variability on timescales of minutes to years. 

In this project, we use all of the snapshots from the North Galactic Cap data which 
covers 8,444 deg^. Part of the South Galactic Cap coverage was repeated, and is in the 
process of b eing observed a t a thi rd epoch; preliminary variability results have already been 



reported by ide Vries et al. 



(J2OO4J ) and a comprehensive analysis of this data set will be 



undertaken separately when the current observations are completed. 

In addition, we have restricted our analysis to isolated point sources. From light 
travel-time arguments, extragalactic sources that vary significantly on timescales of minutes 
to years are necessarily compact. Even though compact components of extended sources 
could vary, we limit our attention herein to point-like (unresolved) sources with a sidelobe 
probability below 0.15. Further, an additional selection criterion was imposed requiring 
these sources to be relatively isolated, with no neighbors within 18"; this minimizes the 
likelihood that differing multi-component fits between observations will produce spurious 
evidence of variability. 

Figure [1] shows the fiux density histograms of all sources, point-like sources, and 
isolated point-like sources. Out of a total source count of 758,942 in the Northern Galactic 
Cap, there are 647,550 sources with sidelobe probabilities less than or equal to 0.15. Of 
these, 519,537 are isolated sources and 279,407 meet our criterion for being point-like (major 
axis < 5'.'97 for 6 < 4° declination and major axis < 6'.'87 for 6 > 4° declination). We 
examine the variability characteristics of these 279,407 sources which have been observed 
between two and seven times each. 
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2.2. VLA Data-recording Glitch 



It recently became clear that the on-line VLA recording system suffers from a rare 
glitch that manifested itself many times in the FIRST snapshot database. Although the 
exact nature and cause for this glitch have not been completely understood, the result is 



that a few visibilities from one sn apshot can 



issue has also been pointed out by 



3 e app ended to the adjacent snapshot. This 



OfeketaL 



(120 10[ ). The prototypical example that led to 



the discovery of this glitch is the source claimed to be a transient based on a com parison 



with the NVSS: VLA 172059.9+38522 flGal-Yam et al 



2006 



Levinson et al. 



2OO2I). The 



authors speculated that this could be a prototype for a new class of transients. Upon 
investigation, it was discovered that one of the snapshot images containing the location of 
this source had been affected by the VLA glitch. A thorough search of the database found 
that 190 snapshot images (0.29%) are apparently affected by this problem; while in many 
cases the problem can be fixed by deleting a few records from the uv data set, in other cases 
subtler anomalies appear. As a consequence, we have simply excluded all these snapshot 
images from our analysis. Additional grid images excised include those found to contain 
imaging artifacts and those with large calibration errors (see below). 

3. Data Analysis 



Using the positions of isolated, point-like sources from the FIRST catalog, we created 
a comprehensive inventory of each object location and the corresponding pixel locations in 
each of the grid images in which it appeared. Every object was found to be covered by from 
two to seven snapshot images as expected from the survey design described above. 
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3.1. Gaussian Fitting 

In each of the snapshot images, at the location of the object, a peak flux density was 
estimated by fitting a Gaussian plus a fiat baseline componenlo. The position, width and 
position angle of the Gaussian, already available from the catalog, were held fixed. The 
RMS noise in each image was also measured around the object's location. For every pixel in 
each image, the "local" standard deviation was calculated by considering a box four times 
the FWHM of the synthesized beam centered on that pixel (the FWHM is three pixels). 
Assuming an object of interest is located at a given pixel, we consider a circle consisting of 
pixels at radius 5 times the FWHM of the synthesized beam. Thus, each of the pixels in 
this circle has a standard deviation that was effectively calculated using pixels in an annulus 
that has roughly an inner radius of 3 times and an outer radius of 7 times the FWHM of 
the synthesized beam. We associate the median of all these standard deviations with the 
rms noise around the object. The advantage of using a median is it effectively ignores pixel 
intensities that are not purely noise such as those from another source (or a sidelobe) near 
to the object of interest. In the case of a marginal (or non-) detection of the object in a 
snapshot image, a 3o" upper limit was assigned. 

3.2. Identification & Removal of Striped Images 

Artefacts in radio interferometric images arise for a number of reasons such as 
calibration errors, interference, etc. For a large-scale survey such as FIRST, it is difficult 
to ensure high data quality at all times during the observations. One not uncommon 



^For the purposes of this paper, it should be noted that our use of the term "baseline" 
refers to a fiat or constant component in the Gaussian fitting and should not be confused 
with the baseline vectors (in units of wavelengths) often used in radio interferometry 
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problem is a striped pattern corrupting an image owing to un-excised interference or other 
deconvolution problems. 

After deriving a preliminary list of candidate variables, we plotted their positions on 
the sky, and found that some objects were spatially clustered. These clusters had different 
shapes, including linear, elliptical and irregular appearances, and their sizes spanned several 
degrees on the sky. Such strong clustering is highly suggestive of systematic problems with 
the images. We investigated the snapshot images associated with these clustered objects 
and found that many, especially toward the southern limit of the survey, have clearly visible 
striped patterns running across them. The width of these stripes is typically greater than 
the angular resolution of the images. Having restricted ourselves to isolated point-like 
sources, we were able to make use of the baseline fitting described in §3.11 We flagged any 
image that contained one or more sources with a baseline component greater than 1.75 
times the local image noise. A total of 2162 (3.96%) such snapshot images were deleted 
from subsequent analysis. Figure |2] illustrates how this step significantly eliminated the 
non-physical clustering of variable and transient source candidates. 

3.3. Identifying Global Amplitude Calibration Errors 

While amplitude and phase calibration errors are minimized in the FIRST imaging 
pipeline for individual snapshots, it is important to keep in mind that the survey is spread 



over very large scales in space an d time. The overa. 



over the course of the survey (see lWhite et al.lll997l ). Relevant to our study is the variation 



1 calibration scale, then, could well vary 



of the overall amplitude calibration scale over the tens of thousands of snapshot images. 

An estimation of the calibration error was performed after a first iteration of steps 
in §3.41 but before the identification of any variables or transients. The algorithm used 



-12- 



considers sources in each snapshot image and compares their behaviour to that in all the 
neighbouring snapshots in which they are also found. The null hypothesis, true for a vast 
majority of the sources, is that sources will exhibit a constant peak flux density across the 
different snapshots in which they are detected. A best-fit algorithm assigns a multiplicative 
amplitude calibration correction factor to each snapshot image such that, when it is applied, 
all the sources in the neighborhood are as close to the null hypothesis behavior as possible 
in a minimum absolute deviation sense. Thus, the amplitude calibration variations are 
estimated locally. 

Figure [3] shows the histogram of the amplitude correction factors, where a value of 
unity indicates no correction need be applied. It can be seen that the distribution, especially 
the inner core comprising the large majority of snapshot images, can be reasonably fit with 
a gaussian profile with a standard deviation of 4.3%. However, tails can be seen on both 
sides of the distribution indicating some large fluctuations in the amplitude calibration for 
a small fraction of the images. We have accounted for the typical calibration uncertainty 
using a conservative value of 5% in determining the flux density uncertainty for individual 
snapshots as described in §3.51 



We experimented with different thresholds for the largest allowable amplitude 
calibration offsets. We set the threshold at 20%; for amplitude offsets below this 
value, the number of candidate variables eliminated was proportional to the number 
of images eliminated, whereas snapshots with higher amplitude corrections contained 
disproportionally larger numbers of candidates. We also performed an extra iteration, 
computing the amplitude correction factor after removing sources that were found to be 
candidate variables and recomputing the calibration correction factors. We find evidence 
that the change in the amplitude correction factors between the two iterations is greater for 
snapshots that contain putative variables than for ones that do not (figure H]). However, the 
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final list of variable candidates between one such iteration and the next changes by only 
~ 2%. Thus, we have used only snapshot images that have an amplitude offset less than 
20% after one iteration of the amplitude correction factor estimation. 

For the set of images retained, we apply, on an image by image basis, the overall 
amplitude calibration correction factors to the peak flux densities, local image noise values, 
and baseline components for all the sources found in those images. Hereafter, these 
corrected values are treated as the true values. 

3.4. Determining the Primary Beam Empirically 

The data were obtained in 2x7 3-MHz channels near 1.4 GHz. While the center 
frequency of these bands was fixed for most of the survey observations, the interference 
environment was constantly changing, leading to data editing which affects the effective 
center frequency of the observing band(s). Thus, the frequency-dependent primary beam, 
normally derived from the AIPS task PBCOR, cannot be adopted blindly. 

Our selection algorithm makes use of the hypothesis that the non-varying source 
population is expected to dominate the variable one. Hence, with the off-axis angle 
information available for each source in each snapshot image, the primary beam can be 
determined empirically. We see from figure |5] that the difference between our estimate for 
the primary beam and the nominal VLA primary beam is at most a few percent. As figure E] 
demonstrates, the uncertainty arising from the primary beam estimation is a negligible 
contribution to the overall error budget, independent of source flux density. 

The peak flux density of each isolated, point-like object measured in each snapshot 
image (as described in §3.ip is normalized by th e reported catalog flux density (after taking 



into account the CLEAN bias as quantified by 



White et al. 



20071 ). Since the snapshot 
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images and the fitted peak flux densities derived therefrom have not been corrected for the 
primary beam attenuation, adopting the null hypothesis that a source's flux density stays 
constant means that these normalized values yield a primary beam normalization at the 
off-axis angle of the measured source. When this normalization is performed for all objects, 
we obtain an empirical primary beam: 

measured map peak flux density (mJy) 
empirical PB 



catalog peak flux density (mJy) — CB (mJy) 



where CB is the CLEAN bias flwhite et allbnOTl ). 



The empirical primary beam values obtained from all the objects in each of their 
corresponding snapshot images were binned in angular offset from the center of the primary 
beam. A sixth-degree polynomial was fitted to determine an analytical expression for the 
empirical primary beam. The error in the primary beam estimate is the uncertainty in the 
mean estimate for the primary beam in each bin, which is the standard deviation in each 
bin divided by the square root of the number of sources in that bin. Thus, a primary beam 
error estimate (used in §3.5p is assigned to each data point on each light curve depending 
on which bin of angular offset the data point falls into. 

Figure [5] shows the scatter plot of normalized flux densities from all the usable snapshot 
images in the upper panel; in the lower panel, the bin-averaged data points from the upper 
panel are overp lotted on the polynomial fit. Fits were obtained independently in eight 
separate declination zones, but no significant variations (> 0.9%) with declination were 
seen, so a single empirical primary beam was used for the whole survey. 
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3.5. Sources of Uncertainty in the Peak Flux Density 

We next estimate the sources of uncertainty in tlie peak flux density obtained for an 
object detected in a snapsliot image. li F = Af/p, where F is the reported flux density, A 
is the absolute amplitude cahbration (ideally A = 1) and p = p{6), the primary beam value 
at an offset (6) from the fleld center, the principal uncertainties that need to be accounted 
for include the following: 

1. local noise in the grid image in the vicinity of the source (A/): This will further be 
amplifled because of the primary beam effect, especially for sources far from the fleld 
center. AF = A Af/p. 

2. uncertainties in primary beam (Ap): We have computed empirically the primary 
beam and its uncertainty from the data as described in §3.41 AF = A f /p^ Ap. 

3. uncertainties in the overall flux calibration amplitude {A A): This uncertainty is ~ 5% 
and was estimated as described in §3.31 AF = AA f /p. 

4. pointing errors of the VLA antenna (A^): This is not the same as the positional 
accuracy of the sources in the catalog. The antennas have an rms pointing error of 
A6 ~ 10". This translates to an error in flux density, AF = A f (dp/dO) A6 /p^. 

The overall error is obtained by adding all the above errors in quadrature. The 
dominant contributions come from the map noise for fainter sources and from the amplitude 
calibration errors for brighter sources. 



AF = ^/ {AA f/pf + {A Af/pf + {A f/p' Apf + U f/p' ^ A^' 

Figure [6] shows the contributions of these different sources of uncertainty as a function 
of angular offset from the center of the image for two sources observed (without primary 
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beam correction) to have flux densities of 1 mJy and 10 mJy, respectively. 

Three other potential sources of error have been assessed and found to be negligible 
compared to the above terms. Firstly, we adopt for each source the FIRST catalog position 
and hold it fixed when fitting for a source's peak flux density in each grid image. Any 
systematic astrometric errors which are a function of position in the primary beam field 
of view will thus induce an underestimate of source flu x density. We ha v e asse ssed such 



McMahon et al 



fl2002[ ). We have 



errors and report the results for the co-added images in 

also assessed the errors for the grid images and find similar results: the maximum error is 

<0'.'3. This will induce a a maximum flux density error of < 1%. 

A second potential effect involves the (generally unknown) source spectral index. 
Since the primary beam shape is a function of observing frequency, the primary beam 
correction changes from the lower to the upper end of the observing band. While our 
empirical determination of the primary beam correction assures that the average value is 
correct for the average source spectral index, a source with an extremely steep or inverted 
spectrum will have an inappropriate primary beam correction applied. We have evaluated 
the magnitude of this effect for extreme sources with spectral indices of +2.0 and —1.5 
compared to a mean source spectral index of ~ —0.7. At an off- axis angle of 20'the error is 
less than 1% in all cases. At 30'off axis, it is < 2% for —1.5 < a < +1.0; for an extremely 
inverted (and very rare) source with a = +2.0 the error does reach 4%, but the other errors 
rise rapidly this far off-axis and still dominate the total error budget. 

Finally, we address the matter as to whether or not variations in the small CLEAN 
bias added to each measured flux density could generate spurious variability. CLEAN bias 
is an at-best partially unders tood manifestation of the non-linear C LEANing algorithm 



applied to snapshot data (see 



Becker et al 



1995; 



Condon et al. 



19981 ). It is known to vary 



with source extent and map rms; in principle, it could also vary with time. For the FIRST 
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catalog, artificial source experiments (inserting sources of known fiux density into a uv data 
set and then measuring the fiux density recovered in the derived image) suggested that, for 
the point-hke sources considered here, a constant value of 0.25 mJy provided a satisfactory 
correction. Becker et al. (1995) did note that for fields with sources brighter than 500 mJy 
(and thus higher rms values), the bias correction could be as great as 0.50 mJy leading to 
a 0.25 mJy underestimate of the true fiux density. All variable candidates in such fields 
discovered here are fiagged. 



Subsequently, we examined this issue in 



White et al. 



(120071 ) using artificial sources as 



well as an ana^ 



( ICondon et al. 



ysisof sources from the deep radio imaging of the Spitzer First-Look Survey 



20031 ). For 145 sources from different fields in the fiux-density range 1 to 
3 mJy, the CLEAN bias in FIRST images was shown to be 0.25 ± 0.04 mJy. Thus, even for 
1 mJy on-axis sources, the size of any CLEAN-bias error is less than that of image noise 
and the amplitude correction error; for brighter, and/or off-axis sources, the uncertainty is 
negligible. 



Selection of Variable Candidates 



4.1. Mean Inferred Peak vs. Catalog Peak Flux Density 



Owing to the reasons discussed in §2.2[ §3.21 and §3.3[ the snapshot images that form 
the basis of our search are a subset of those from which the FIRST catalog was constructed 
The catalog peak flux densities were derived by a weighted average estimate of the inferred 



fluxe s where the weights a re related to the square of the primary beam pattern ( IWhite et al. 



1997 



Condon et al. 



19981 ). We have recalculated the mean value for each source using 



only those snapshot images actually included in our search and denote it by /, where 
/ = YliiVlfi/ YliiVl- f and (/) will be used interchangeably hereafter. 
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Inferred peak flux density points that lie far from / are considered outliers. We have 
selected as candidate variables outliers that deviate from / by more than five times their 
inferred peak flux density uncertainty, defined in §3.5[ 

4.2. Removal of Contamination from False Positives 

We have identified several additional effects other than genuine variability that can 
generate outliers. 

• Low-Quality Images: Noisy images and images with interference and other 
deconvolution errors are significant sources of false detections. Such outliers have 
been largely eliminated using the technique described in §3.2( a few additional bad 
fields containing large numbers of spurious sources were also deleted. 

• Sidelobes: There were instances in which sidelobes from nearby sources have 
significant flux density in one grid image but are absent or marginal in others, thus 
producing an apparent source that looks both genuine and variable. The FIRST 
catalog-generation algorithm does calculate a sidelobe probabihty for each source, 
and we selected only objects with a sidelobe probability less than 0.15. Nonetheless, 
some unlabeled sidelobes are still present. All fields containing bright sources were 
examined carefully to eliminate sidelobes as spurious variables. Variable and transient 
sources suspected of being sidelobes from strong sources with peak flux densities of 
> 500 mJy within a radius of 31' have been flagged. 

• Calibration Problems: Defective calibration can cause sources to vary between 
different snapshot images even after incorporating the grid image amplitude correction 
factors. To identify and eliminate such sources, we used the catalog to select a few 
(usually between two and four) objects that are neighbors of each outlier (within a 



-19- 

9' radius) and are also point-like. These were normalized by their respective mean 
peak flux densities. Each was examined for any deviations from the ideal normalized 
value of unity. Some neighbors exhibited the same variability pattern as the outliers, 
thus conflrming the presence of remaining calibration problems. Some others were 
simply inconsistent with no variation, making the corresponding outliers suspect. We 
used this procedure on all outliers to select a final list that appeared to be free from 
calibration problems. About 30 outliers have neighbors with a deviation from the 
mean of 3a (~ 5 would be expected from a normal distribution), and 17 have a value 
> 3.5cr (figure [7]); the table includes the deviation for the neighboring sources in sigma 
for each source. 

We also checked to see if there is any systematic variation of the fraction of variables 
and transients with off-axis angle. We plotted the relative frequencies of the off-axis angle 
for the source in the light curve containing the data point most discrepant from the mean 
peak flux density. Simulataneously, we plotted the same quantities for all the isolated 
point-like sources in FIRST. In flgure [HI we flnd no major systematic differences between 
the outliers and the rest of the isolated point-like sources in FIRST. The slight shift of the 
distribution of variables toward the fleld center arises from the fact that the low observed 
flux densities for sources far off axis increases their uncertainty, and thus reduces our ability 
to detect variability. 

After removal of all identifled false positives, we are left with a sample of 1627 variables 
and transients. 

4.3. Indicators of Variability 

We used three indicators to select variable and transient candidates. 
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4-3.1. x^-P^ohahility 



X^-statistic falls above a certain value x^ = Xli ( "^""^ ) ; with /j = inferred peak flux density 
of the r data point, / = average inferred peak flux density weighted by the primary beam, 
(Tj = uncertainty in /j, z/ = n — 1 is the number of degrees of freedom, and n, is the number 
of data points in the light curve. Since there is only one constraint in the model for the null 
hypothesis - that the peak flux density is constant with time - it follows that v = n — 1. 

The probability threshold chosen is P^x^^v) < 5.733 x 10^^ which is equivalent to 
the probability that the absolute value of a normally distributed variable falls beyond five 
standard deviations. 

4-3.2. Maximum Deviation from, the Mean Inferred Peak Flux Density 

An alternate indicator of variability is the maximum deviation of any individual data 
point in a light curve from the null hypothesis that the flux density is constant at a 
value given by the mean inferred peak flux density. We denote this by (Xmax defined as 
MAX I ^^ |. We chose a threshold amax > 5. 

4.3.3. Data Point Pair with Most Significant Flux Density Difference & Timescale 

Information 

In order to preserve information about variations between any pair of data points in 
a light curve (and the associated timescale), we also estimate the variation of each point 
from all the others in a light curve. The significance of this difference is given by the ratio 
obtained by dividing the difference by the uncertainties in the two flux density estimates 
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added in quadrature, 

I/. - fj 



A^, 



af + a] 



where, A^ = significance ratio between i and j data points, fi = inferred peak flux 
density at the i data point, fj = inferred peak flux density at the j data point, 
(jj = uncertainty in fi, and, aj = uncertainty in fj. We select the pair that has the 
maximum absolute value defined as Amax = MAX {Ay}. For an outlier selected with this 
indicator, the threshold is Amax > 6. 

P{x^, v) is an indicator of overall variability of a light curve that can arise from low 
but sustained variations, large, sudden variations, or both. Although P(x^, J^) provides 
a valid statistical indicator of variability, information about timescales and the temporal 
localization of the variability has been ignored. Unlike the indicator P(x^, v), crmax and 
Amax are indicators of large sudden variability and temporally localize the variability in 
the light curve. 

A similar estimation of P{x^, v), crmax and Amax are performed for the neighbours of 
the outliers. The respective thresholds were chosen as those beyond which the population 
of varying sources and their non-varying neighbours look statistically different as defined by 
a K-S test. 

We found that the variability measure given by Amax > 6 identified no new 
variables that were not already found by one of the other two measures, namely, 
P{x^i ^) < 5.733 X 10"''' and crmax > 5. However, we include this variability measure 
in subsequent discussions because it represents a useful quantity in describing the most 
significant flux density difference between data points in a light curve and provides us with 
an idea of the timescale of variability. 

Figure [H] shows the distribution of crmax against /. Note that a few objects have a 
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relatively low significance as per the measure amax, but are still included as variable when 
above the threshold of the other indicator of variability P{x^i ^)- Figure [TUl illustrates the 
distribution of outliers described by the parameter P(x^; ^)- The fraction of outliers to the 
right of the dashed line are those selected by the criterion amax > 5. The number with a 
left arrow indicates the number of outliers with even smaller probabilities that could not be 
represented in the plot. 

The sky positions of the outliers are shown in figure [2l One notable feature is that the 
density of outliers below a declination of ~ 4°is enhanced. This is because the synthesized 
beam with which the data is convolved increases below a declination of ~ 4°as is clearly 
illustrated in figure [11] Consequently, more objects meet the criterion for a point-like 
source, increasing the density of both non-variable and variable objects. 

5. A Catalog of Highly Variable Radio Sources 

Table [1] presents a sample page from the list of 1627 highly variable radio sources 
identified in our study. The complete table is available online in electronic format. Column 
1 provides the source position derived from the FIRST catalog. Columns 2 and 3 give, 
respectively, the FIRST catalog peak flux density and the mean peak flux density derived 
from the light curves. The range of flux densities for the source derived from the grid 
images are in column 4, while column 5 provides the NVSS catalog peak flux density. Note 
that, while FIRST and NVSS have very different angular resolutions, our restriction of this 
study to isolated point sources in the higher-resolution FIRST survey means resolution 
effects should not be a factor in most cases. 

These data are followed by the number of observations available (col. 6). The next 
three columns provide the three measures of variability: crp(^2^) for the light curve (col. 
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7), the maximuni deviation of a single data point from the mean flux density in the light 
curve (col. 8), and the difference (in a) between the most significantly different pair of data 
points (col. 9). crp(^2 j^) is the number of standard deviations on both sides of a normal 
distribution beyond which the probability from this normal distribution matches Pix^, v)- 
The maximum-to-minimum flux density ratio is provided in column 10. The T^^j^ (days) in 
column 11 denotes the minimum timescale at which two data points differ by at least 6cr. 
In cases where the maximum absolute difference of any pair of data points never reaches 
a value 6cr, the value denotes the timescale at which the maximum absolute difference in 
column 10 is attained. In a few cases where the date of observation cannot be reliably 
obtained from the snapshot header, the entry is left without a numerical value. Column 
12 contains a flag 'N' if the outlier is found to be in the vicinity of a source brighter than 
500 mJy within 31'. Column 13 provides the distance by which the mean normalized peak 
flux density of the outlier's neighbors departs from the mean of the distribution of such 
normalized neighbor peak flux densities from all outliers, and is a proxy for the quality of 
the fleld. The higher the absolute value in this column, the less ideal the quality of the fleld 
surrounding the outlier and hence, the less reliable the physical variability of the outlier. 

The type of light curve is denoted in column 14 by V (Variable) and T (Transient). 
The apparent SDSS-i band magnitudes are provided in column 15 where a SDSS match is 
available. Finally, column 16 gives the best ranked counterpart at other wavelengths. This 
ranking is explained in ^ 

Figure [T2] shows the distribution of the ratio of maximum to minimum flux density for 
our 1627 variables which range s from 25% to a fact or of > 20; overplotted is this ratio for 



the sample of variable sources ( Ide Vries et al.ll2004j ) with the same flux density distribution 
as ours. The areal density of their variables is ~ 1 deg~^ while we flnd it to be ~ 0.2 deg~^ 
for our sample. This difference is primarily due to the lower threshold of detection (4cr) 
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used by Ide Vries et al.l (l2004f ). Sources for which lower hmits are used are indicated by 
shading. The relation between maximum-to-mininiuni flux density ratio and mean peak 
flux density is shown as a scatter plot in figure [T3i 

6. Identification of Variable Radio Sources 

Table [2] presents the results of cross-matching our catalog of variable sources with a 
variety of catalogs and databases. The first column lists the source type, followed by the 
specific catalogs used for cross identification and the matching radius adopted for each (col. 
3). The rank in column 4 indicates the order in which the matches were applied; e.g., we 
first sought matches with bright star catalogs, then with a pulsar catalog, then with quasar 
catalogs, etc. This is important in interpreting the number of matches A^ found in column 
5 where we list both the number of total matches including those previously identified in a 
higher-ranked catalog and, in parentheses, the number of unique matches to the specified 
catalog. For example, the photometric SDSS quasar catalog has 68 total matches, but only 
67 of them are real; the 68 is also matched to a pulsar (which is the true identification). 

Column 6 gives the match number as a percentage of all variable sources (e.g., a total of 
7.4% of all variables are SDSS quasars). Columns 7 and 8, respectively, give the percentage 
of all FIRST sources matching each catalog that are identified as variable (e.g., 76% of all 
FIRST pulsars are variable, but only 1.2% of SDSS spectroscopically identified quasars 
with F/i?5T counterparts are highly variable), and the percentage of all F/i?S'T isolated 
point sources in the catalog that find identifications in the given catalog. Parentheses are 
used as in column 4. 

In all cases, the rate of chance coincidences is low, typically less than one source per 
category with the exception of SDSS galaxies where up to seven chance coincidences (out 
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of 442 matches) are possible. In summary, we find 5 bright stars, 13 radio pulsars, 120 
quasars, and 489 galaxies among our variable sources, yielding firm identifications for 38% 
of our objects. An additional 118 objects are identified with unclassified stellar objects in 
the SDSS and GSC II databases; a total of 50 objects have X-ray counterparts, although 
only 5 of these are not also identified optically (mostly as quasars). This leaves 877 objects 
or 54% of the total without identifications in other wavelength bands (although 499 of these 
are found in other radio catalogs). We discuss below each class of objects in turn, and 
compare the properties of the identified objects to the unidentified ones in order to seek 
clues as to the identities of the latter sources. 

In figure [HI we show how the maximum-to-minimum flux density ratio differs for 
the five different classes of variable objects. The pulsars are, as expected the most highly 
variable, while the sources classified as galaxies have slightly higher variability amplitudes 
than the quasars. The objects with no counterparts closely mimic the distribution of 
galaxies. 

Figure dS] illustrates the distribution of mean peak flux densities for the various source 
classes. As expected, the relative fraction of QSOs with respect to galaxies increases with 
increasing mean peak flux density. The pulsars and the stars are much fainter with mean 
peak flux densities of a few mJy. Again, the distribution of unidentified objects appears 
to follow the trend shown by galaxies and interestingly, it also has a significant tail at the 
high-mean peak flux density end. 

6.1. Radio Pulsars 

While ~ 70 known radio pulsars fall within our survey area, most are both faint 
and have steep radio spectra, leading to only 17 detections in the FIRST survey catalog 
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(Table[3]). Of these, 9 (~ 50%) appear in our catalog of highly variable sources. By slightly 
relaxing our criterion for a point-source, we find an additional four pulsars that would have 
been classified as variables by our algorithm; we add these to the total number of outliers 
quoted throughout the paper. Thus, more than three-quarters of the detected radio pulsars 
in the survey are found to be highly variable. 

While the intrinsic radio luminosity of a pulsar is steady, its emission suffers from 
interstellar scintillation which arises when signals traveling in slightly different directions 
are alternately scattered into, and out of, our line of sight by the fluctuating electron density 
distribution of the interstellar medium. The effect is largest when the number of scattering 
centers is small, so the highest variability is seen for the nearest pulsars; indeed, 11 of our 
13 variable objects have dispersion measures (DM) of < 20 cm~'^pc; the other two have 
DM values of 27 cm'^^pc and 41 cm~'^pc. Of the four pulsars not seen to vary significantly, 
two have flux densities < 1.5 mJy, too close to our threshold to detect variability, and a 
third has a DM > 35. 

The typical time scale for scintillation is ~ 1 min at 20 cm (to be compared with 
our 165 s integration time) and the decorrelation bandwidth is typically smaller than our 
50 MHz bandwidth, so our observational parameters largely smooth over the fluctuations. 
Nonetheless, pulsars are among the most variable objects in our sample, with 8 out of 13 
varying by more than a factor of 4 (see figure [T2|) . Interestingly, 17 of the 877 variables 
which lack optical counterparts entirely also vary by this large factor; this is to be contrasted 
to the fact that none of the 120 sources identified as quasars are this variable, and only 
1% of the sources coincident with galaxies vary by such large factors. All these 17 sources 
have flux densities less than 4 mJy and most appear only once or twice, falling below our 
detection threshold in the other observations. An examination of the radio spectrum and 
polarization of these sources could provide candidates for pole-on, very short period, and/or 
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intermittent pulsars worthy of further study. 

6.2. Optical Counterparts 

A total of 732 radio variables have optical counterparts in one or more of the catalogs 
used in the identification program summarized in table 121 the vast majority of these are 
from a match to the SDSS DR7 catalog. Figures [T6] shows SDSS ^-band magnitudes for the 
various source classes, while figure [T7] displays SDSS color-color diagrams for the identified 
galaxies and quasars of our sample. We discuss each sample of objects in turn below. 

6.2.1. Stars 

Most stars are very faint at radio wavelengths. Of the more than 800,000 sources in 
the FIRST catalog, only 37 match objects in the bright star catalogs we examined to within 
1'.'4. To check that the potentially high proper motions of these (mostly) nearby stars 
have not obviated any true matches, we expanded the search radius to 5 ^^ no additiona l 



Helfand et al 



match es were found when proper-motion-corrected positions were used. 
( 119991 ) conducted a more exhaustive search of the first 4,760 deg^ of the FIRST survey and 
found a total of 26 radio-detected stars, 5 of which were at flux densities below the survey 
limit of 1.0 mJy. One of the detections reported in that earlier survey, the flare star (08 08 
55.47 +32 49 06.0), is found in our catalog of highly variable objects. It has a light curve 
showing variability on timescales of days, months and years. In total, we find only 5 stars 
both bright enough and variable enough to make our sample of variables and transients. 
All are previously known variables. 
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6.2.2. Quasars 



A total of 53 quasars from the SDSS DR5 spectroscopic quasar catalog (jSchneider et al. 



20071 ) are found in our list of variables. In addition , another 67 objects in the DR6 



photometric quasar catalog of 



Richards et al. 



( 120091 ) are in our list, making a total of 



120 quasars in the sample or 7% of all variables. For the whole FIRST catalog there are 
8396 and 10,025 matches from the spectroscopic and photometric catalogs, respectively, 
accounting for nearly 2.5% of all FIRST sources. Tabled however, includes only matches 
with isolated point-like sources from the FIRST catalog. 

The quasars have the highest mean radio flux density (> 10 mJy) among our identified 
source classes (figure [T^ and the lowest mean max-to-min flux density ratio. This is in 
part a selection effect, in that only in bright sources does modest variability exceed our 
significance threshold. Nonetheless, with our coarse and uneven sampling, we have found 
only a dozen quasars that vary by more than a factor of 2.5. 

6.2.3. Galaxies 

The largest identified segment of our sample is coincident with objects classified as 
galaxies in SDSS (with a few additional galaxies contributed from GSC II, 2MASS, NED, 
etc.). The 489 galaxies comprise roughly 30% of our sample. The galaxies have a broad 
spread in magnitudes (figure [T6|) and a lower mean radio flux density than the quasars but 
a somewhat higher amplitude of variability (Figure [T^ . 

Since our matching radius is only 1'.'4, the radio sources are all coincident, within the 
uncertainties, with the galaxy nuclei. AGN are thus undoubtedly the source of variable 
radio emission in virtually all of these objects. Optical spectroscopy (and/or hard X-ray 
imaging) of this sample could be interesting, in that it could reveal the fraction of buried 
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AGN which show no optical evidence of an accreting black hole. 

We examined by eye SDSS images for all 1627 variable source candidates. Virtually 
all of the matched sources had clear counterparts and almost all of the unmatched sources 
were blank fields as expected. However, there were a handful of cases where there was a 
coincident optical object that did not appear in the SDSS catalog; these are noted in the 
table. In addition, there were several cases in which the radio source was superposed on a 
large (>10") galaxy and not coincident with the galaxy nucleus. In a few cases, a backgound 
source was visible through the galaxy light and is the likely radio source counterpart. In 
several other cases, however, no background source is apparent although it could, of course, 
simply be obscured by the foreground galaxy's light or dust. 

6.2.4- Unclassified Stellar Counterparts 

A total of 114 radio variables are coincident with unclassified stellar objects in the 
SDSS DR7 catalog; a few additional such objects are found in the GSC II catalog in areas 
lying outside of the SDSS coverage. Figure [18] shows the SDSS color-color diagrams for 
these stellar objects; comparison with figure fT71 shows a large majority of these objects have 
colors consistent with quasars, while virtually all of the remainder fall within the galaxy 
contours in color-color space. The majority of these objects fall below the mj = 21.0 cutoff 
for the SDSS photometric quasar catalog, but nearly 60 objects lie above this threshold. 
Spectroscopic follow-up of these objects could provide insight into the completeness of the 
photometric quasar catalog^. 



^The three brightest objects with rrij < 18 include one source with no spectrum and two 
with spectra classified as 'STAR'. 
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6.2.5. Unidentified Sources 

There are no optical (or X-ray) counterparts for 877 sources comprising 54% of our 
sample. Figures [12] and [17] show that these sources mostly cohabit the parameter-space 
of galaxies and QSOs, although there are a few notable outliers with very high variability 
amplitudes as noted above. 

7. Summary 

After analyzing ~ 55, 000 snapshot images from the FIRST survey of the radio sky 
covering 8444 deg^ with sensitivity down to 1 mJy for variability and transient phenomena, 
we have assembled a sample of 1627 sources that are significantly variable. This sample 
was matched with multi-wavelength catalogs to identify counterparts. We found 13 radio 
pulsars, 53 SDSS spectroscopic QSOs, 67 SDSS photometric QSOs, 489 galaxies, 5 stars, 
123 optically detected but unclassified sources and 877 objects lacking optical counterparts. 
The unidentified sources mostly occupy the parameter-space of galaxies and QSOs but 
there are a few notable outliers. Follow-up observations of several of these source classes 
would likely be fruitful. 

Our study has shown that there is much to be discovered in the dynamic radio sky. 
Exploration by the next-generation instruments sampling different portions of the spectral 
and temporal domains will prove to be highly productive. 
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comments on this paper. RHB's work was supported in part under the auspices of the 
US Department of Energy by Lawrence Livermore National Laboratory under contract 
W-7405-ENG-48. 



-31- 

The National Radio Astronomy Observatory is a facility of the National Science 
Foundation operated under cooperative agreement by Associated Universities, Inc. This 
research has made use of the NASA/IPAC Extragalactic Database (NED) which is operated 
by the Jet Propulsion Laboratory, California Institute of Technology, under contract with 
the National Aeronautics and Space Administration. 

Funding for the SDSS and SDSS-II has been provided by the Alfred P. Sloan 
Foundation, the Participating Institutions, the National Science Foundation, the U.S. 
Department of Energy, the National Aeronautics and Space Administration, the Japanese 
Monbukagakusho, the Max Planck Society, and the Higher Education Funding Council for 



England. The SDSS Web Site is |http : //www . sdss . org/ 



The SDSS is managed by the Astrophysical Research Consortium for the Participating 
Institutions. The Participating Institutions are the American Museum of Natural History, 
Astrophysical Institute Potsdam, University of Basel, University of Cambridge, Case 
Western Reserve University, University of Chicago, Drexel University, Fermilab, the 
Institute for Advanced Study, the Japan Participation Group, Johns Hopkins University, 
the Joint Institute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysics 
and Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences (LAMOST), 
Los Alamos National Laboratory, the Max-Planck- Institute for Astronomy (MPIA), the 
Max-Planck- Institute for Astrophysics (MPA), New Mexico State University, Ohio State 
University, University of Pittsburgh, University of Portsmouth, Princeton University, the 
United States Naval Observatory, and the University of Washington. 

This research has made use of data obtained from the High Energy Astrophysics 
Science Archive Research Center (HEASARC), provided by NASA's Goddard Space Flight 
Center. 



-32- 

REFERENCES 

Aller, M. F., Aller, H. D., Hughes, P. A., 2003, ApJ, 586, 33 

Becker, R. H., White, R. L., Helfand, D. J., 1995, ApJ, 450, 559 

Bower, G. C, Saul, D., Bloom, J. S., Bolatto, A., Filippenko, A. V., Foley, R. J., Perley D. 
2007, ApJ, 666, 346 

Carilh, C. L., Ivison, R. J., Frail, D. A., 2003, ApJ, 590, 192 

Condon, J. J., Cotton, W. D., Greisen, E. W., Yin, Q. F., Perley, R. A., Taylor, G. B., 
Broderick, J. J., 1998, AJ, 115, 1693 

Condon, J. J., Cotton, W. D., Yin, Q. F., Shupe, D. L., Storrie-Lombardie, L. J., Helou, G. 
Soifer, B. T., Werner, M. W., 2003, AJ, 125, 2411 

Cordes, J. M., Lazio, T. J. W., McLaughlin, M. A., 2004, NewAR, 48, 1459 

Cordes, J. M, 2008, ASPC, 395, 225 

Croft, S., Tomsick, J. A., Bower, G. C, 2011. arXiv:1107.5039C 

de Vries, W. H., Becker, R. H., White, R. L., Helfand, D. J., 2004, AJ, 127, 2565 

Fender, R. et al., 2008. larXiv:0805.4349l 

Gal- Yam, A. et al., 2006, ApJ, 639, 331 

Gregorini, L., Ficarra, A., Padrielh, L., 1986, A&A, 168, 25 

Helfand, D. J., Schnee, S., Becker, R. H., White, R. L., McMahon, R. G., 1999, AJ, 117, 
1568 

Hessels, J. W. T., Stappers, B. W., van Leeuwen, J., 2009, ASPC, 407, 318 



-33- 

Hey, J. S., 1946, Nature, 157, 47 

Hopkins, A., Windhorst, R., Cram, L., Ekers, R., 2000, Exp. Astron., 10, 419 

Hyman, S. D., Lazio, T. J. W., Kassim, N. E., Bartleson, A. L., 2002, AJ, 123, 1497 

Hyman, S. D., Lazio, T. J. W., Kassim, N. E., Ray, P. S., Markwardt, C. B., Yusef-Zadeh, P., 
2005, Nature, 434, 50 

Hyman, S. D., Wijnands, R., Lazio, T. J. W., Pal, S., Starling, R., Kassim, N. E., Ray, P. S., 
2009, ApJ, 696, 280 

Johnston, S. et al., 2007, PASA, 24, 174 

Johnston, S. et al, 2008, Exp. Astron., 22, 151 

Levinson, A., Ofek, E. O, Waxman, E., Gal- Yam, A., 2002, ApJ, 576, 923 

Lister, M. L., Tingay, S. J., Preston, R. A., 2001, ApJ, 554, 964 

Lonsdale, C. J. et al., 2009, lEEEP, 97, 1497 

McLaughlin, M. A. et al., 2006, Nature, 439, 817 

McMahon, R. G., White, R. L., Helfand, D. J., Becker, R. H., 2002, ApJS, 143, 1 

Napier, P. J., 2006, ASPC, 356, 65 

Ofek, E. O., Breslauer, B., Gal- Yam, A., Frail, D., Kasliwal, M. M., Kulkarni, S. R., 
Waxman, E., 2010, ApJ, 711, 517 

Richards, E. A., Fomalont, E. B., Kellermann, K. I., Windhorst, R. A., Partridge, R. B., 
Cowie, L. L., Barger, A. J., 1999, ApJ, 526, 73 

Richards, G. T. et al., 2009, ApJS, 180, 67 



-34- 

Rupen, M. P., 2000, AAS, 196.3002 

Schneider, D. P. et al., 2007, A J, 134, 102 

White, R. L., Becker, R. H., Helfand, D. J., Gregg, M. D., 1997, ApJ, 475, 479 

White, R. L., Helfand, D. J., Becker, R. H., Ghkman, E., de Vries, W., 2007, ApJ, 654, 99 

Wilkinson, P. N., Kellermann, K. I., Ekers, R. D., Cordes, J. M, Lazio, T. J. W., 2004, 
NewAR, 48, 1551 

Windhorst, R. A., Hopkins, A., Richards, E. A., Waddington, I., 1999, ASPC, 193, 55 



This manuscript was prepared with the AAS lATfrjX macros v5.2. 



-35- 








2 3 

log,o(Fpeak/mJy) 



Fig. 1. — Histogram of peak flux densities for sources in the FIRST survey. The hatched 
area includes all point sources, while the black shaded histogram is for the isolated point 
sources used in this study. 
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(a) Before flagging (b) After flagging 

Fig. 2. — The sky positions (J2000) of outliers before (Left) and after (Right) flagging and 
removing the snapshot images that contain one or more sources with a significant baseline 
component from the gaussian fitting routine (See §3.2p . Note the clear reduction in the source 
clusters. The horizontal dashed line indicates the boundary in declination below which the 
outlier density shows enhancement; this is a consequence of the change in the convolving 
beam size which reduces the number of sources eliminated from consideration by modest 
extent (cf. Figure fTTl). 
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Fig. 3. — Histogram of overall amplitude correction factors for grid images. The dashed 
line represents the best gaussian fit to the histogram whose parameters (peak, mean (/i) and 
standard deviation (a)) are displayed in the plot. 
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Fig. 4. — Relative frequencies of the difference in median calibration offsets (in percent) for 
the grid images between the first and second iterations where the second iteration has ex- 
cluded the outliers (determined from the previous iteration) from the calibration calculations. 
The shaded portion represents all snapshot images while the unshaded portion represents 
snapshot images that contain one or more outliers. The unshaded distribution is clearly 
wider than the shaded one as indicated by the gaussian fits, implying that the amplitude 
calibrations of the snapshot images are affected by the inclusion of the outliers. 
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Fig. 5. — (following page) Upper panel: Fitted peak Gaussian flux densities of isolated 
point sources in the snapshot images normalized by their respective catalog flux densities. 
Lower panel: Binned average (filled circles) of the panel above over-plotted with the fitted 
polynomial primary beam model (solid line) and the nominal primary beam correction factor 
found in AlPS ('plus' symbols). 
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Fig. 6. — (following page) Relative contributions of errors from various causes. Upper panel: 
For a source with an observed peak flux density of 1 mJy. Lower panel: For a source with 
an observed peak flux density of 10 mJy. The solid line shows the primary beam-corrected 
intrinsic flux densities of sources as a function of off-axis angle. 
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Fig. 7. — Histogram of mean normalized flux densities of neighbouring sources associated 
with outliers. 
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Fig. 8. — Relative frequencies of the off- axis angles of the most discrepant data point from 
(/). The unshaded distribution corresponds to all the isolated point- like sources in FIRST 
while the shaded distribution corresponds to the variables and transients in the sample. 
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Fig. 9. — The measure of variability, namely, crmax plotted against average peak flux den- 
sity. The different symbols correspond to outliers that were selected on the basis of various 
measures of variability. 
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Fig. 10. — The histogram of logarithmic values of x^-probability, -P(x^5 '^)- The fraction of 
outliers to the right of the dashed line are those selected by criteria other than P(x^5 ^)- The 
number with a left arrow indicates the number of outliers with even smaller probabilities 
that could not be represented in the plot. 
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Fig. 11. — Scatter plot of the sizes of outliers vs. the declination. The horizontal dashed line 
shows the boundaries where the deconvolving beam size changes (cf. Figure [2]). The vertical 
dashed line arises from our definition of point-like - a source with an intrinsic size < 2'.'5. 
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Fig. 12. — Histogram of ratios of maximum to minimum peak flux densitie s of all of our 



de Vries et al. 



variab le objects. The hatched histogram is for objects from the sample of 

( 20041 ). The dark shading indicates the number of lower hmits that fall inside this bin. 



These lower limits were derived using 3cr upper limits for non-detections in the denominator. 
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Fig. 13. — (following page) Scatter plot of the ratios of maximum to minimum peak flux 
densities of all objects against their average peak flux densities. The upward arrows denote 
that the value of the ratio is derived from a minimum peak flux density which was a non- 
detection whose 3cr upper limit was used, correspondingly yielding a lower limit for the ratio. 
The numerator-denominator pair in the ratio comes from the pair of data points in the light 
curve that yields the maximum absolute value of Amax- 
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Fig. 14. — (following page) Histogram of ratios of maximum to minimum peak flux densities 
of different classes of variable objects. The dark shading indicates the number of lower 
limits that fall inside this bin. These lower limits were derived using 3o" upper limits for 
non-detections in the denominator. 
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Fig. 15. — Histogram of FIRST catalog peak flux densities of different classes of variable 
objects. 
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Fig. 16. — Histogram of SDSS z-band magnitudes of different classes of variable objects 
detected in SDSS. 
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Fig. 17. — (following page) SDSS color-color diagrams for variable objects detected and 
classified as galaxies and QSOs in SDSS. The red and green ellipses denote the color-color 
distribution of galaxies and QSOs respectively. 
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Fig. 18. — (following page) Scatter plot of SDSS colors of variable objects detected but 
unclassified in SDSS. The dotted and dashed ellipses denote the color-color distribution of 
galaxies and QSOs respectively, identical to those in figure ITT! The open circles in the legend 
represent the numbers in the plot indicating the unclassified objects that fall outside of both 
ellipses. The numbering scheme is used for easy identification of objects that consistently 
fall outside of the QSO and galaxy ellipses. 
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Table 1. Summary of Properties of Variables & Transients 



Coordinates (J2()()()) 



/ (m.Ty) 



"^nbr Type Mag. 



Cross-ID 



Range 



<(<^) 



(days) 



01 05.881 +20 23 10.85 


5.91 


5.85 


2.56 


- 6.19 


4.28 


3 


01 07.863 +24 53 33.32 


8.78 


8.96 


3.04 


- 9.15 


8.08 


3 


01 35.629 +33 12 33.28 


4.75 


4.53 


<3.05 


- <9.02 


4.00 


6 


01 49.193 +03 39 03.44 


9.18 


9.05 


4.67 


- 9.58 


8.73 


3 


02 10.433 +16 33 28.02 


1.25 


1.28 


<1.05 


- <3.75 




4 


02 25.829 +30 37 38.12 


4.44 


4.46 


<2.47 


- <7.81 


3.76 


3 


02 40.649 +21 56 20.99 


8.40 


7.14 


6.76 - 


11.24 


9.19 


3 


02 49.673 +31 36 40.33 


2.51 


2.70 


<1.67 - 


<15.62 




6 


02 53.621 +34 31 29.70 


4.54 


4.47 


1.43 - 


<15.83 


4.61 


5 


02 55.734 +04 33 47.25 


2.23 


2.21 


1.39 


- 3.10 




3 


03 02.267 +03 37 53.99 


2.49 


2.60 


<1.66 


- 3.29 




4 


03 19.427 +04 38 15.57 


3.64 


3.58 


2.24 - 


<13.88 


3.39 


4 


03 57.217 -01 38 26.51 


2.20 


2.34 


<1.73 


- <7.93 




4 


04 28.315 +03 52 59.25 


5.38 


5.42 


<4.02 


- 7.42 




4 


04 39.877 +50 28 20.71 


2.32 


2.32 


1.43 


- 3.59 




3 


04 50.878 +29 20 33.40 


1.90 


1.66 


<1.40 


- 2.96 




3 


04 53.441 -04 59 41.50 


4.62 


4.52 


1.71 


- 4.70 




3 


05 26.547 +32 54 33.42 


3.53 


3.39 


1.80 


- 4.03 


4.21 


4 


05 51.716 +21 11 20.45 


10.99 


11.12 


6.29 - 


13.44 


8.62 


5 


06 06.530 +61 42 04.63 


23.40 


25.19 


24.94 


- 36.24 


25.18 


5 


06 08.024 +36 43 31.29 


3.11 


3.01 


<2.06 - 


<24.94 




6 


06 18.560 -07 36 45.19 


6.53 


7.24 


4.32 


- 8.39 


9.55 


2 


06 26.572 +00 09 31.41 


4.44 


4.35 


2.60 - 


10.80 


4.47 


4 


07 28.795 +25 30 18.66 


17.03 


16.69 


16.41 


- 38.70 


13.66 


6 


07 47.271 -06 08 57.51 


4.11 


4.08 


1.87 - 


<13.68 




4 


08 06.423 +36 53 12.45 


2.84 


2.75 


1.65 


- 5.35 


2.66 


3 


08 12.253 +33 13 22.10 


2.37 


2.61 


1.50 


- 4.22 


3.29 


4 



4.86 


5.18 


6.97 


7.27 


5.70 


5.61 


4.90 


5.22 


5.32 


4.06 


6.51 


6.39 


5.27 


5.27 


6.10 


5.48 


8.00 


7.09 


5.19 


3.65 


6.61 


5.99 


6.36 


5.13 


5.51 


5.28 


5.70 


4.92 


5.98 


4.80 


5.63 


4.35 


6.30 


6.49 


5.11 


5.32 


6.11 


6.53 


5.43 


3.92 


5.28 


5.33 


7.32 


6.93 


5.18 


4.98 


5.40 


4.93 


6.84 


6.63 


5.31 


4.85 


6.95 


5.35 



4.78 


2.33 


5 


6.50 


3.01 


2 


5.35 


>1.58 




4.64 


1.95 




5.25 


>2.02 




6.17 


>1.97 




5.28 


1.66 




5.70 


>1.83 


95 


6.79 


3.43 


5 


4.87 


>2.24 


388 


6.56 


>1.99 


1 


5.72 


>1.73 


388 


5.61 


>1.70 


1 


5.35 


>1.53 


1 


5.94 


2.50 


27 


5.92 


>2.12 


11 


5.94 


2.76 


40 


4.98 


2.24 


488 


5.50 


1.82 


5 


3.61 


1.41 





5.40 


>1.62 


5 


6.31 


1.94 





4.75 


1.86 


3 


5.18 


>1.96 





5.92 


2.87 


2 


5.64 


3.24 


6 


6.76 


2.81 


483 



0.09 


V 


0.75 


V 


1.53 


V 


0.19 


V 


0.23 


V 


0.65 


V 


3.17 


V 


0.95 


V 


0.68 


V 


0.26 


V 


0.57 


V 


0.12 


V 


1.20 


V 


0.62 


V 


0.10 


V 


1.05 


V 


1.43 


V 


0.44 


V 


0.23 


V 


0.18 


V 


0.41 


V 


0.25 


V 


0.93 


V 


0.06 


V 


0.96 


V 


0.10 


V 


0.35 


V 



21.24 SDSS-G 



19.73 SDSS-G 

21.28 SDSS-G 

19.39 SDSS-G 

20.14 SDSS-G 



19.26 SDSS-QSO(P) 



16.02 SDSS-G 



21.79 SDSS-G 



15.29 SDSS-G 



21.89 SDSS-G 



18.13 SDSS-G 
19.62 SDSS-G 



CO 
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Table 2. Summary of Cross-ID Statistics for Variables & Transients. 



Cross-ID 


Catalog 


Rank" 


AS {") 




Matches 


Match 


Rate 




N 


% of Variables 


Variable % % 


of FIRST sources'^ 


STAR 


HIPPARCOS 
TYCHO 


1 
2 


1.4 
1.4 


3 (3) 

4(2) 


0.18 (0.18) 
0.25 (0.12) 


27.3 (27.3) 
22.2 (22.2) 


0.004 (0.004) 
0.006 (0.003) 


PULSARS 


ATNF 


3 


3.0 


13 (13) 


0.8 (0.8) 


76.48 (76.48) 


0.006 (0.006) 


QSO 


SDSS (S)"* 


4 


1.4 


53 (53) 


3.26 (3.26) 


1.16 (1.16) 


1.632 (1.632) 


SDSS {P)° 


5 


1.4 


68 (67) 


4.18 (4.12) 


1.1 (1.09) 


2.203 (2.239) 




SDSS DR7 


6 


1.4 


442 (431) 


27.17 (26.49) 


0.55 (0.54) 


28.92 (29.68) 


GALAXY 


GSC-2f 

2MASSS 


7 
8 


1.4 
1.4 


349 (56) 
121 (1) 


21.45 (3.44) 
7.44 (0.06) 


f 
g 


f 
g 




APMUKS{BJ)'» 


13 


6.0 


2 (1) 


0.12 (0.06) 








SDSS DR7 


6 


1.4 


227 (114) 


13.95 (7.01) 


0.91 (0.75) 


8.9 (5.7) 




GSC-2f 


7 


1.4 


111 (4) 


6.82 (0.25) 


f 


f 




2MASS'S 


8 


1.4 


27 (0) 


1.66 (0.00) 


g 


g 


UNCLASSIFIED 


CHANDRA 


9 


3.0 


7(1) 


0.43 (0.06) 


1.45 (1.23) 


0.173 (0.048) 




XMM 


10 


15.0 


15 (2) 


0.92 (0.12) 


1.52 (1.03) 


0.353 (0.116) 




RASS-BSC 


11 


30.0 


15 (0) 


0.92 (0.00) 


2.29 (0.00) 


0.234 (0.011) 




RASS-FSC 


12 


60.0 


13 (2) 


0.8 (0.12) 


0.70 (0.49) 


0.664 (0.246) 




GB-87'' 


13 


6.0 


21 (7) 


1.29 (0.43) 








VLSS"* 


13 


6.0 


5 (2) 


0.31 (0.12) 








WB-92'' 


13 


6.0 


13 (1) 


0.8 (0.06) 






UNIDENTIFIED 


ABELL-04'> 
B3'' 


13 
13 


6.0 
6.0 


1 (1) 
1 (1) 


0.06 (0.06) 
0.06 (0.06) 








wn'' 


13 


6.0 


2 (1) 


0.12 (0.06) 








NVSS 


14 


7.1 


989 (487) 


60.79 (29.93) 


0.75 (0.62) 


47.46 (46.88) 




NONE 


15 


. . . i 


377 (377) 


23.17 (23.17) 


0.43 


■ ■ ■ (100.0) 


TOTAL 








1623+4'= 


100.0 







Rank indicates the order in which the matches arc made with different catalogs. 

Includes only the isolated point-like sources from the FIRST catalog 
"^Four pulsars arc slightly resolved but nevertheless included; more detailed data available in table [31 

Data obtained from SDSS DR,5 spectroscopic QSO sample 
^Data obtained from SDSS DR6 photometric QSO sample 

Has solely made use of the Guide Star Catalog II cross-match information available in the FIBST catalog 
^Has solely made use of the 2 Micron All-Sky Survey cross-match information available in the FIRST catalog 

Data obtained from the NASA Extragalactic Database 
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^Search radius varies depending on the catalog being searched 
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Table 3. Summary of the properties of pulsars. ° 



Coordinates (J2000) 



A (") PSR Name /^at (mJy) Period (s) DM (crn^^pc) Distance (kpc) 



Pulsar matches with the sample of outliers 



10 24 38.698 -07 19 19.07 

09 22 14.008 +06 38 22.84 

10 22 58.011 +10 01 52.85 
12 39 40.386 +24 53 49.87 
08 26 51.438 +26 37 22.83 
16 52 03.080 +26 51 39.85 
15 18 16.831 +49 04 34.19 
10 12 33.387 +53 07 02.09 
15 09 25.675 +55 31 32.90 



0.17 


J1024-0719 


5.21 


0.005 


6.49 


0.53 


0.51 


J0922+0638 


10.33 


0.431 


27.27 


1.20 


0.40 


J1022+1001 


3.69 


0.016 


10.25 


0.40 


1.19 


J1239+2453 


11.53 


1.382 


9.24 


0.86 


1.19 


J0826+2637 


11.14 


0.531 


19.45 


0.36 


0.56 


J1652+2651 


6.27 


0.916 


40.80 


2.93 


0.31 


J1518+4904 


5.03 


0.041 


11.61 


0.70 


0.66 


J1012+5307 


2.20 


0.005 


9.02 


0.52 


0.62 


J1509+5531 


10.02 


0.740 


19.61 


2.13 



Pulsar matches passing variability criteria but not strictly point-like 



16 07 12.078 -00 32 40.98 
10 23 47.622 +00 38 41.60 
09 43 30.092 +16 31 34.67 
16 40 16.699 +22 24 08.98 



0.42 J1607-0032 

1.09 J1023+0038 

2.32 J0943+1631 

0.60 J1640+2224 



3.93 


0.422 


10.68 


0.59 


3.12 


0.002 


14.32 


0.90 


1.51 


1.087 


20.32 


1.76 


1.92 


0.003 


18.43 


1.19 



Other pulsar matches in the FIRST survey 



09 53 09.287 +07 55 35.94 
15 43 38.837 +09 29 16.52 
07 51 09.148 +18 07 38.73 
11 15 38.456 +50 30 12.68 



0.38 J0953+0755 

0.26 J1543+0929 

0.17 J0751+1807 

0.67 J1115+5030 



83.22 


0.253 


2.96 


0.26 


6.17 


0.748 


35.24 


7.69 


1.42 


0.003 


30.25 


0.62 


1.00 


1.656 


9.20 


0.54 



^Data obtained from the ATNF Pulsar Catalog 



